Primers and a screening method for identification of artemisinin 

producing plants 



Field of the present Invention 

5 The present invention relates to a pair of primers with forward primer of SEQ ID NO. 1 
having sequence of CCAAGCTTGCTGAACGCATCGG, and reverse primer of SEQ ID 
No. 2 having sequence of CCAAGCTTGCCACGCAGGATTATC, and a screening 
method for early identification of plants Artemisia annua having high content of 
artemisinin and thereby helping generation of plant population with further high content 
10 of artemisinin. 

Background and prior art references of the invention 

The plant Artemisia annua (family: Asteraceae) produces a sesqui terpenoid lactone 
endoperoxide named artemisinin which is a promising antimalarial drug effective against 
Plasmodium falciparum, Plasmodium vivax at nanomolar concentration. Artemisinins are 

15 active against Schistosoma mansoni and S. japonicum in-vitro and in-vivo in experiments 
in animals. These schistosomes, like malarial parasites, degrade haemoglobin and 
produce hemozoin. These compounds are also active against Leishmania major, 
Toxoplasma gondii and Pnenmocystic carinii in-vitro and against P. carinii in-vivo. 
Artemisinins have immunosuppressive activity and also potential anticancer activity. For 

20 these activities, the doses of artemisinin required are substantially higher than the dose 
for antimalarial activities. According to Meshnick et at., (1996) (Microbiological 
Reviews 6:301-315) the antimalarial endoperoxides including artemisinin, 
dihydroartemisinin and arteethers, are not likely to be useful for other therapeutic 
purposes except against malarial parasites. 
' 25 Although artemisinin rapidly suppresses the activity of parasites like Plasmodium vivax 
and P. falciparum, problems with high rate of recrudescence (>10% recrudescence 
infections), short half life persist. Hence, there is a need to develop new drugs against 
quinolone resistant pathogenic bacteria. It is a known fact that clinically used 
antibacterial broad spectrum compounds such as quinolones which exhibit DNA gyrase 

30 activity of Mycobacterium sp. (causing tuberculosis), Haemophilus sp. and Haemophilus 
influenzae are gradually becoming ineffective due to the occurrence of mutatious in 
gyrase genes and their natural selection under continuous use of such drug. 
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The compound a arteether developed as antimalarial drugs by Central Drug Research 
Institute (CDR1), Lucknow, India and Central Institute of Medicinal & Aromatic Plants 
(CIMAP), Lucknow, India, after phase II clinical trial is a stable derivative of 
artemisinin. Earlier we have found a novel property of a -arteether as being effective 
5 against the gyr A mutant strains of E. coli but ineffective against wild type strains (US 
patent 6,127,405). Also we have developed a strategic and novel composition comprising 
a arteether and nalidixic acid or quinolone drugs which is useful as an advanced 
generation drug to counter the resistance development itself and having a potential to be 
used in treating infectious diseases and in inhibiting the resistance developed due to 
10 mutation in the gyr A gene of bacteria, particularly in those cases where drug resistant 
strains are known to appear very frequently (US patent 6,42,3741). 

In an earlier invention a method was also provided for maximization of artemisinin yield 
of the plant Artemisia annua, said method comprising sowing seeds of Artemisia annua 
plant on raised bed nursery during second and third week of December and maintaining 

15 the moisture throughout; transplanting seedlings thus obtained bearing at least 5-15 
leaves into the main field fertilized with fertilizer, preferably NPK @ 80,40,40 kg/ha to 
achieve a population density of 50,000 to 200,000 per ha followed by light irrigation in 
the second week of March and irrigation every fortnight thereafter; harvesting the crop 
four times by cutting the plant tops leaving 75-100 cm part of plant for further 

20 regeneration, the said harvests are performed in a manner that the first harvest is done in 
fourth week of May, second harvest in third week of July, third harvest in second week of 
September and fourth harvest in third week of October of each year; and at each 
harvesting time care is taken to care at least one green branch, and extracting artemisinin 
from the plant tissue immediately after each harvest. (US patent 6,393,763). 

25 Considering the high value of the chemical artemisinin for use in derivatization to 
different semisynthetic product of immence importance the need of the hour is to still 
increase the yield. Agronomic practices and scheduling of the harvest timings to obtain 
higher biomass yield do not take into account the genotypes as all the plants are harvested 
together. Since the plant Artemisia annua is highly cross pollinated like the members of 

30 family Asteraceae the chemical character like 'artemisinin content' seggregate like any 
other phenotypic characters as multigenic characters always segregate in the progeny 
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population. Due to this all the progeny plants of the high artemisinin containing plant 
may not yield same amount of the chemical. Some will be high, some medium and some 
very low. 

Considering the problem of identification of the high artemisinin containing plant 
5 genotypes at the nursery stage, to discard the low artemisin genotypes for the purpose of 
planting only those genotypes which could produce high amount of artemisinin during 
maturity in the main field a systematic approach for identification of DNA marker was 
launched. In this process the marker was identified which could differentiate the high 
artemisinin genotypes from low artemisinin genotypes at the seedling stage itself. These 
10 selected seedlings showing the presence of the DNA marker then could be taken for 
further matting between them to generate plants produce highest biomass as well as 
higher artemisinin. 
Objects of the present invention 

The main object of the present invention is to develop a pair of primers capable of 
15 identifying plants Artemisia annua containing high content of artemisinin. 

Another object of the present invention is to develop a screening method for early 

identification of plants Artemisia annua having high content of artemisinin. 

In yet another object of the present invention is to develop a method for generation of 

plant population with further high content of artemisinin. 
20 Still another object of the invention is to develop a method for plants to be identified at 

nursery stage itself having high content of artemisinin. 

Still another object of the present invention is to develop a method to identify plants 
having high content of artemisinin ranging between 0.5 to 1 .4 w/w%. 
Summary of the present invention 

25 The present invention relates to a pair of primers with forward primer of SEQ ID NO. 1 
having sequence of CCAAGCTTGCTGAACGCATCGG, and reverse primer of SEQ ID 
No. 2 having sequence of CCAAGCTTGCCACGCAGGATTATC, and a screening 
method for early identification of plants Artemisia annua having high content of 
artemisinin and thereby helping generation of plant population with further high content 

30 of artemisinin. 
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Detailed description of the present invention 

The present invention relates to a pair of primers with forward primer of SEQ ID NO. 1 
having sequence of CCAAGCTTGCTGAACGCATCGG, and reverse primer of SEQ ID 
No. 2 having sequence of CCAAGCTTGCCACGCAGGATTATC, and a screening 
, 5 method for early identification of plants Artemisia annua having high content of 
artemisinin and thereby helping generation of plant population with further high content 
of artemisinin. 

In still another embodiment of the present invention, wherein a pair of primers with 
forward primer of SEQ ID NO. 1 having sequence of 
10 CCAAGCTTGCTGAACGCATCGG, and reverse primer of SEQ ID No. 2 having 
sequence of CCAAGCTTGCCACGCAGGATTATC. 

In still another embodiment of the present invention, wherein A pair of primers as 
claimed in claim 1, wherein the primers help identify plants Artemisia annua containing 
high content of artemisinin. 
15 In still another embodiment of the present invention, wherein A screening method for 
early identification of plants Artemisia annua having high content of artemisinin and 
thereby helping generation of plant population with further high content of artemisinin, 
said method comprising steps of: 

• isolating DNA from the plant, 

20 • running PCR on the isolated DNA using a pair of primers of SEQ ID Nos. 1 and 

2, 

• identifying plants having high content of artemisinin, containing nucleotide SEQ 
ID No. 3, and 

• crossing the identified plants to produce the next generation plants with further 
25 higher content of artemisinin. 

In still another embodiment of the present invention, wherein the plants can be identified 
at nursery stage itself. 

In still another embodiment of the present invention, wherein the high content refers to 
concentration of 0.4 w/w/% or more. 
30 In still another embodiment of the present invention, wherein the plant with higher 
content of artemisinin ranging between 0.5 to 1 .4 w/w% are produced. 
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In still another embodiment of the present invention, wherein the increase in the 
artemisinin genetic advance (GA) is about 0.4 w/w % in first four years. 
In still another embodiment of the present invention, wherein the artemisinin content 
heritabilily (h) is about 80. 
5 In still another embodiment of the present invention, wherein the method helps maintains 
elite genotypic population. 

In still another embodiment of the present invention, wherein the invention relates to a 
method of developing a specific DNA marker in form of a Sequence Characterized 
Amplified Region (SCAR) marker genetically tagged to the high artemisinin synthesizing 

10 genotypes of the plant Artemisia annua. Further the invention also describes the complete 
unique DNA sequence present in the high artemisinin containing plants. The DNA primer 
sequence usable for PGR amplification of tagged DNA of the plant containing high 
artemisinin. Also this invention illustrates the method to screen out the low artemisinin 
containing genotypes of Artemisia annua at the nursery stage itself using the DNA 

15 primers, to grow only the high artemisinin yielding genotypes to maturity for obtaining 
better harvest of artemisinin. Further the invention describes the method of developing 
high artemisinin containing plants of Artemisia annua using the SCAR marker through 
marker-assisted breeding. The invention provides the use of DNA technology to maintain 
elite genotypic population in a cross-pollinated plant species. 

20 The research on genome analysis is being taken up as a necessity to understand the 
genomic constitution of individuals in terms of DNA content, nature and variations etc. 
The data from gnome analysis are of direct relevance to molecular plant breeding in 
which morphological characters can be tagged to unique DNA sequences and then 
inheritance patterns of DNA markers can be utilized to confirm the presence of traits 

25 even before expression. Techniques are available to differentiate even similar looking 
individuals of a population on the basis of DNA sequence variation. Some recent 
important discoveries from application point of view towards genetic analysis include 
Restriction endonucleases mapping and Polymerised Chain Reaction for amplifying 
DNA sequences from traces. These discoveries have led to the means and techniques 

30 used to study the differences or uniqueness in the DNA sequences otherwise known as 
Polymorphism in the DNA. The tools like RAPD, AFLP, RFLP, SCAR, micro-satellite 
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and many others were invented earlier and used in literature extensively for 
differentiating and marking the plants for different characters. 

Development of marker correlating to high content of Artemisinin in the plant Artemisia 
annua. 

5 Selection of genotypes 

The seeds of ten chemotypic accessions of the plant Artemisia annua were collected from 
Kashmir and further studies were carried out in Lucknow field station. Seeds of A. annua 
were sown in pots with mixture of soil and FYM (farmyard manure) in the ratio of 1:1 
and germination in glass house conditions during the month of November of the years 

10 1998 - 2001. The seedlings having 10 cm height were transplanted with spacing 50cm 
between rows and 30cm between plants. The soil of experimental field was sandy loam in 
texture and neutral in reaction (pH 7.6). The plots were fertilized with FYM (Farm yard 
manure) @20 kg\ha before transplanting for obtaining optimum performance. Plots were 
prepared 3mx3m size with irrigation channels. For Hybridization, six seed lots were 

15 selected out of 10 seedlots (obtained from Kashmir in the year 1998) were transplanted in 
alternative rows. From the next year (1999) on wards the progeny seedlings of the 
chemotypically selected plants were planted again in alternate rows. All the seedlings 
were checked for artemisinin content after extraction. About O.lg dry powdered plant 
material was extracted in 10 ml of hexane by heating at 60°C for 3 minutes and left for 

20 overnight at room temperature. Then extract was filtered and evaporated on water bath at 
50°C. After evaporation extract was dissolved in 1ml hexane and used in TLC. Properly 
(20 x20 cm E-MEREK) dissolved extract was spotted in TLC plates at 1cm apart along 
with standard (lmg\ml). Spotted TLC plate was dipped in solvent (mobile phase) 
Hexane:Diethyl ether (1:1) Plate was- dried in air and dipped in developing reagent 

25 Glacial acetic acidxonc. Sulphuric acid:Anisaldehyde (50:1: 0.5ml) and heated at 120°C 
for 10- 15 minutes and then Stabilized and scanned (540nm, visible) (Densitometer 
CAMAG :Switerzerland). The TLC plates were scanned and the artemisinin content of 
individual progeny plants were quantified. From the analysis the plants producing 
trace(0.10% or less) artemisinin and the plants producing more than 0.4% artemisinin 

30 were selected and finally 10 plants from each category were taken for DNA analysis. 
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DNA isolation and PGR amplifications 

DNA was isolated from the leaf tissue essentially according to the protocol described 
earlier (Khanuja SPS, Shasany AK, Darokar MP, Sushil Kumar (1999) Rapid Isolation 
of PCR Amplifiable DNA from the Dry and Fresh Samples of Plants Producing Large 
5 Amounts of Secondary Metabolites and Essential oils by Modified CTAB Procedure. 
Plant Molecular Biology Reporter, 17, 74.). Polymerase chain reactions (PCRs) were 
carried out in 25 |al volume. A reaction tube contained 25 ng of DNA, 0.2 unit of Taq 
DNA polymerase, 100 |aM of each dNTPs, 1.5 mM MgCb and 5 pmol of decanucleotide 
primers. The amplifications were carried out using the DNA Engine thermal cycler (MJ 

10 Research, USA) using 94°C, 35°C and 72°C temperatures for 40 cycles (Khanuja SPS, 
Shasany AK, Srivastava A, Sushil Kumar (2000). Assessment of genetic relationships in 
Mentha species. Euphytica ,111, 121-125.). The amplified products were separated on 
1.2% agarose gel containing 0.5 ^ig ml' 1 of ethidium bromide and photographed with 
Image master VDS (Pharmacia). The bands were analyzed using Image master ID elite 

15 software and the graphic phenogram of the genetic relatedness among the accessions was 
produced by means of UPGMA (unweighted pair group method with arithmetic average) 
cluster analysis. Custom-made decanucleotide primers were synthesised in the laboratory 
on Applied Biosystems 392 DNA-RNA Synthesizer and were designated as MAPOl to 
MAP20. The sequences of the primers M AP01 to M AP20 were: 

20 AAATCGGAGC, GTCCTACTCG, GTCCTTAGCG, TGCGCGATCG, 
AACGTACGCG, GCACGCCGGA, CACCCTGCGC, CTATCGCCGC, 
CGGGATCCGC, GCGAATTCCG, CCCTGCAGGC, CCAAGCTTGC, 
GTGCAATGAG, AGGATACGTG, AAGATAGCGG, GGATCTGAAC, 
TTGTCTCAGG, CATCCCGAAC, GGACTCCACG, AGCCTGACGC, respectively. 

25 The other sets of primers used included kit J, O and T, each consisting of 20 random 
decamer primers, procured from Operon Technologies Inc., USA. 

All the RAPD profiles thus generated were analyzed for bands always appearing with all 
the high artemisinin containing genotypes (more than 0.4%) and absent in the genotypes 
containing trace or no artemisinin. We could detect a band at approximately 850 base 
30 pair region amplified with the primer S^AACC™^* (MAP 12, Sequence ID 1) 
which consistently showed its presence in the genotypes containing more than 0.4% 
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artemisinin and absent in the genotypes with trace or no artemisinin. This finding was 
interesting considering the complex nature of the artemisinin biosynthetic pathway. For 
all other primers the amplified products showed variable positions in these genotypes and 
could not be correlated. 

5 The presence of the band in the segregating populations having high artemisinin could be 
ascertained as the samples of 10 analyzed plants having high artemisinin were drawn 
from different populations. Similarly, the sample of 10 plants for trace or no artemisinin 
drawn from different populations could show always the absence of the band. As all the 
plants analyzed were from the same initial population the genes for artemisinin 
10 biosynthesis were assumed to be normal. So the presence and absence of the band could 
be correlated to the regulatory function associated with the expression of some of the 
genes associated with the biosynthetic pathway. But certainly the DNA band of about 850 
base pair size could be con-elated with the biosynthesis of more than 0.4% artemisinin in 
Artemisia annua. 

15 In the next steps the DN A fragment described earlier was eluted out from the agarose gel 
and (since the fragment was amplified with the primer containing Hind HI restriction site) 
restricted with Hind III restriction enzyme (Recognition and restriction site 
5 f AAGCTT3'). Similarly, pBluescript II SK(+) procured from Stratagene Inc. was used to 
clone the fragment at the Hind III site using T4 DNA ligase enzyme available 

20 commercially. Escherichia coli strain DH5a, procured from Stratagene Inc again was 
transformed with this constructed plasmid and transformed cells were isolated on agar 
plates containing nutrient broth and ampicillin. All the experiments were performed 
according to the protocol Sambrook et al (1988). This fragment was sequenced 
completely with the help of Ml 3 forward and T3reverse primer (the sequence sites are 

25 present in the plasmid pBluescript II SK(+) and the nucleotide sequence is given below of 
SEQ ID NO. 3. 

AAGCTTGCTG AACGCATCGG TGTTACTGCC GCAGCCCGTG AACTCAGCCT 
GTATGAATCA CAACTCTACA ACTGGCGCAG TAAACAGCAA AATCAGCAGA 
CGTCTTCTGA ACGTGAACTG GAGATGTCTA CCGAGATTGC ACGTCTCAAA 
30 CGCCAGCTGG CAGAACGGGA TGAAGAGCTG GCTATCCTCC AAAAGGCCGC 
GACATACTTC GCGAAGCGCC TGAAATGAAG TATGTCTTTA TTGAAAAACA 
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TCAGGCTGAG TTCAGCATCA AAGCAATGTG CCGCGTGCTC CGGGTGGCCC 
GCAGCGGCTG GTATACGTGG GTGTCAGCGG CGGACAAGGA TAAGCCCGCG 
TAAGCAGTTC CGCCAACACT GCACAGGGGG TTGTCTCGCG GGTTTTACCC 
CGGGTCAAAC AAGCGTTACC GGTGCCCCAC GCTTGACCGG ATGACCTGCG 
5 GTGCTCAGGG TTACCCTTTA ACGTAAAAAA CCCGTGGCGG CAAGCTTGCC 
CGGTCAGGGA CTGAAGGCAA AGGCCTCCCG GAAGTTCAGC CCGGTCAGCT 
ACCGCGGCAC ACGGGCCTGC CTGTGTCAGA AAATCTGTTG GAGCAGGATT 
TTTACGCCCA GTGGCCCGAA CCAGAAGTGG GCAGGAGACA TCACGTACTT 
ACGTACAGAT GAAGGCTGGC TGTATCTGGC AGTGGTCATT GACCTGTGGT 
10 CACGTGCCGT TATTGGCTGG TCAATGTCGC CACGCATGAC GGCGCAACTG 
GCCTGCGATG CCCTGCAGAT GGCGCTGTGG CGGCGTAAGA GGCCCCGGAA 
CGTTATCGTT CACACGGACC GTGGAGGCCA GTACTGTTCA GCAGATTATC 
AGGCGCAACT GAAGCGGCAT AATCTGCGTG GAAGTATGAG CGCAAAAGGT 
TGCTGCTACG ATAATGCCTG CGTGGAAAGC TT 
_ 15 . Based on the sequence at the ends forward and reverse primers were synthesized with the 
sequence 

Forward Primer 5'CCAAGCTTGCTGAACGCATCGG3' (SEQ ID NO. 1) 

Reverse primer 5'CCAAGCTTGCCACGCAGGCATTATC3' (SEQ ID NO. 2) 

These sequences were used to amplify the genomic DNA of Artemisia annua (both high 

20 content of artemisinin and low content of artemisinin). The plant genomic DNA with 
high artemisinin content could generate a band of 936 bp where as in plants containing 
low amount of artemisinin the absence of the band was conspicuous. 
Use of the marker to generate a population of plants with high artemisinin content. 
In the first year polycross nursery was designed with alternate male and female line 

25 choosen among the seedlots. These plants were randomly picked up from the nursery 
raised from the 6 selected seed lots. The plants, which were designated as female (270 
plants), were analyzed for artemisinin content, which were selected for further 
experimentation. Seed sample were collected from these selected plants (13 in number) 
containing high amount of artemisinin(0.15 to 0.20%) and planted again in a polycross 

30 nursery in the second year. Next year 180 plants were analyzed for artemisinin content 
and 13 plants containing 0.45 to 0.50% artemisinin were selected for planting in the third 
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year. At this point 10 plants with more than 0.4% artemisinin and 10 plants containing 
trace amount artemisinin were taken for DNA isolation to develop SCAR marker as 
described previously. The SCAR marker was used to select plants from the nursery raised 
from the seeds selected 13 seedlots, and 12 plants from each seedlots showing the 
presence of SCAR marker were selected for random crossing among the plants in the 
third year. Randomly plants were analyzed for artemisinin content and among 150 plants 
analyzed 20 plants having artemisinin 0.8 to 1.0% were selected for next year (fourth 
year) planting. The seeds from these plants were grown in the nursery and 12 SCAR 
positive plants from each seed lot were grown randomly to facilitate cross pollination. 
From these 200 plants were analysed for artemisinin content and 11 plants were selected 
having 1.0 to 1.16% artemisinin content. Simultaneously, increase in the mean 
artemisinin content of the plants analysed every year were calculated. 

Flow sheet 1. 
Six seed lot selected based 

On Artemisinin content 
(more than 0.1%) 

Planted in alternate row 

i 

270 plants were analyzed for 
Artemisinin 

13 plants were selected) 
(0.12-0.2%) 

seeds II nd year 

6 plots 1 80 plants analyzed 

13 plants selected 0.45-0.50% 

1 

seeds grown in nursery III rd year 
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12 plants each seed lot Transplanted having SCAR marker 

i 

Analysis for SCAR marker and artemisinin content 

• i 

20 SCAR positive 0.8- 1% artemisinin containing plants selected 

i 

Seeds grown in nursery IV th year 

i 

1 2 plants of each positive for SCAR marker in three replication were transplanted 

i 

11 plants having 1-1.16 % 

Genetic advancement 

The key metabolite synthesis (Artemisinin content) was studied for genetic advancement 
which showed an upward trend beginning with 17.33% increase in the mean artemisinin 
content for the first year, crossing 50% in second year, 60% in third year and remaining 
at 42.06% in fourth year. The advancement in artemisinin content was calculated as per 
Singh and Chaudhary (1977) (Singh R K and Chaudhary B D (1977). Biomedical 
methods in quantitative genetic analysis. Kalyani Publications, New Delhi. 
GA : Genetic advance = ix h 2 x op 

i = i is the standardized selection differential 

ap = phenotypic standard deviation 

h 2 = heritability 

Table 



S.No. 


Traits 


I year 


11 year 


III year 


IV year 


1 


Artemisinin GA 


0.013 


0.167 


0.36 


0.39 




Content(%) GA(% of x) 


17.33 


53.87 


60.00 


52.06 




Mb) 


18 


52 


76 


70 
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One of the object of the present invention was to develop a marker system to be used in 
breeding Artemisia annua plant for high artemisinin content. The marker was developed 
and described as the unique sequence which appear in plants containing high amount of 
artemisinin (0.4% or more). The Indian genotypes of Artemisia annua found in wild have 
low content of artemisinin. This marker system can distinguish plant tending .to 
synthesize high amount of artemisinin when the biosynthetic system of the plant is 
functional (i.e. structural genes). Other objective of the invention was to generate a 
breeding and selection method using the marker assisted breeding to increase the content 
of artemisinin in the plants. From 0.1% artemisinin content, the plants were improved to 
1 .2% using the protocol and the marker system. 
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